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Sampling from Gibbs distribution is a central problem in computer science as well as in statistical 
physics. In this work we focus on the k-colouring model and the hard-core model with fugacity A 
when the underlying graph is an instance of Erdos-Renyi random graph G(n,p), where p = d/n 
and d is fixed. 

We use the Markov Chain Monte Carlo method for sampling from the aforementioned distribu- 
tions. In particular, we consider Glauber (block) dynamics. We show a dramatic improvement on 
the bounds for rapid mixing in terms of the number of colours and the fugacity for the correspond- 
ing models. For both models the bounds we get are only within small constant factors from the 
conjectured ones by the statistical physicists. 

We use Path Coupling to show rapid mixing. For k and A in the range of our interest the technical 
fi ■ challenge is to cope with the high degree vertices, i.e. vertices of degree much larger than the 

expected degree d. The usual approach to this problem is to consider block updates rather than 
single vertex updates for the Markov chain. Taking appropriately defined blocks the effect of high 
degree vertices somehow diminishes. However, devising such a construction of blocks is a highly 

V^JQ ■ non trivial task. 

^sO ' We develop for a first time a weighting schema for the paths of the underlying graph. Vertices 

which belong to "light" paths, only, can be placed at the boundaries of the blocks. Then the tree-like 
local structure of G(n, d/n) allows the construction of simple structured blocks. 



(N 



Interestingly enough, the weighting schema captures so well the desired properties of the blocks 
that we do not even need to argue explicitly on spatial mixing of Gibbs distribution for the range 
of k, A we consider. However, we do believe that our approach has further consequences as far as 
correlation decay is regarded. Thus, apart from the standard path coupling analysis, the main bulk 
of the analysis we use is of probabilistic nature. 
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1 Introduction 

Sampling from Gibbs distribution is a central problem in computer science as well as in statistical 
physics. Examples include sampling from the uniform (or a weighted) distribution over combinatorial 
structures like /c-colourings, independent sets, matchings of a given graph G. In this work we focus on 
colourings and independent sets when the underlying graph is an instance of Erdos-Renyi random graph 
G(n,p), where p = d/n and d is 'large' but remains bounded as n — >■ oo. We say that an event occurs 
with high probability (w.h.p.) if the probability of the event to occur tends to 1 as n — > oo. 

For this kind of problems the most powerful algorithms and somehow the most natural ones are based 
on the Markov Chain Monte Carlo (MCMC) method. The setup is an ergodic, time-reversible Markov 
chain over the /c-colourings (or independent sets) of the underlying graph. The updates guarantee that 
the equilibrium distribution of the chain is the desired one. Here we use standard Glauber block updates 
in the course of this paper we refer to the chains as Glauber dynamics. The main technical challenge is 
to establish that the underlying Markov chain has rapid mixing, i.e. it converges sufficiently fast to the 
equilibrium distribution (see ll9l [T6llT5l ). 

Given the input graph G(n,d/n), the focus is on two distributions. The first one is the colouring 
model, i.e. the uniform distribution over the /c-colourings of the input graph. The second one is the 
hard-core model with fugacity A, i.e. the independent set a is selected with probability proportional to 
\' a \ . The parameters of interest are the number of colours k and the fugacity A. The aim is to show rapid 
mixing for k as small as possible and A as large as possible for the corresponding models. 

For MCMC algorithms to converge, typically, the bounds for both k and A are expressed in terms of 
the maximum degree of the underlying graph. Examples of such bounds are ||5] [10] [TT] [13] US ED Ell 
for colouring and ||6] [7] [20] |25j for independent sets. In that terms, what makes the case of G(n, d/n) 
special is the (relatively) big fluctuations in the degree of the vertices. To be more specific, w.h.p. the 
vast majority of vertices in G(n, d/n) are of degree close to d, while the maximum degree is as huge as 
( inh^n ) • m sucn a situation, it is natural to expect that the rapid mixing bounds for both k, A depend 
on the expected degree d, rather than maximum degree. 

Sophisticated but mathematically non rigorous arguments from statistical physics (e.g. in |[T8l ) sup- 
port this picture. They suggest that w.h.p. over the instances of G(n, d/n) the Glauber (block) dynamics 
on /c-colouring has rapid mixing for any k > d. Furthermore, for k < d the chain is expected to be non- 
ergodic and only weaker notions of convergence hold. To our knowledge, there are no predictions for 
the fugacity as far as the hard-core model is concerned. However, using the result in [17] and standard 

id— l) d ~ 1 

arguments we could conjecture that we have rapid mixing as long as A < - (d _L d ~ % 

The best bounds for Monte Carlo sampling appear in |[22l (which improved on (H). The authors in 
11221 provide for a first time rapid mixing bounds for k and A which depend on the expected degree d. 
That is, w.h.p. over G(n, d/n) there are functions f(d) and h(d) such that Glauber dynamics has rapid 
mixing for /c-colourings and hard-core for k > f(d) and A < h(d), respectiveljlJ. However, the values 
for k and A that are allowed there are many orders of magnitude off the conjectured bounds. Here we 
improve on these bounds dramatically. We show that w.h.p. over the underlying graph G(n, d/n) we 
have rapid mixing for k > U-d and for A < ^rrg. That is, we approach the conjectured bounds for 
rapid mixing only within small constants. 

Remark. The main strength of MCMC approximate sampling algorithms relies on the fact that the 
configuration space should be, somehow, "well-connected". Rapid mixing allows approximating the 
target Gibbs distribution within distance err by letting the algorithm run for m(err _1 ) x poly(n) steps. 
There can be approximate sampling algorithms based on weaker assumptions. In O the author of this 
work proposed such an algorithm for approximate sampling /c-colourings of G(n, d/n) which has a 



'Even though these functions are not given explicitly it is conceivable from the analysis that it holds that f(d) > d c and 
h{d) < dT c ' for fixed c, c' > 0. 



notable performance in terms of minimum k, as it requires k > (2 + e)d. This algorithm is not a Monte 
Carlo one. Of course the approximation guarantees there are weaker. That is, the error of the output is a 
vanishing function of the size of the graph n, i.e. for a fixed sized graph the error is fixed. On the other 
hand, for the algorithms here the error can be arbitrarily small by allowing the algorithm to run for a 
sufficiently large number of steps. Furthermore, the approach we follow here applies to models other 
than colouring, e.g. hard-core model. 

We use the well-known Path Coupling technique, from f3], to show rapid mixing. Path Coupling is also 
used in both of the previous papers on the problem, i.e. H E3- For k and A in the range of our interest 
the technical challenge is to cope with the high degree vertices, i.e. vertices of degree much larger than 
d. The natural approach is to consider block updates rather than single vertex updates for the Markov 
chain. In particular we use the observation that the effect of high degree vertices somehow diminishes 
when they are away from the boundary of their block. Devising such a block construction is a highly 
complex task. We introduce for a first time a weighting schema for the paths of the underlying graph 
which allows a desired block construction. 

To be more concrete, we use our weighting schema to assign weight to each path in G(n, d/n). 
These weights allow to distinguish which vertices can be used for the boundaries of the blocks. We call 
such vertices as break-points. A break point should have all the paths emanating from it of sufficiently 
small weight. It turns out that w.h.p. there is a plethora of break-points in G(n,d/n). This allows 
creating small, simple structured blocks. For further discussion on the weighting schema see Section 

Remark. In the full version of this work we study some extra models which are of main interest in 
statistical physics, e.g the Ising model and the Potts model with positive temperature. 

Notation We use small letters of the greek alphabet to indicate colourings or independent sets, e.g. a, 
t. Also, by a(v) we indicate the assignment of the vertex v under the configuration a. For a vertex set 
B we call (outer) boundary of B the vertices outside B which are adjacent to some vertex inside B. 

1.1 The Models and the Algorithm 

We consider the colouring model and the hard core model. For each of these two models we consider a 
graph G = (V, E) and a set of spins C. We define a configuration space Q, C C . Given Q, the model 
specifies a distribution \x : Q — > [0, 1] . This distribution is usually called Gibbs distribution. 

Colouring Model. Given a graph G = (V, E) and a sufficiently large integer k, the colouring model 
specifies the following: The configuration space Q, is the proper /c-colourings of G. The Gibbs distribu- 
tion is the uniform distribution over Q. That is, for a G U it holds that 

^ = w 

Hard Core Model. Given G = (V, E) and a real A > 0, the hard core model with fugacity A is as 
follows: The configuration space Q is all the independent sets of G. The Gibbs distribution specifies 
that for each ae!], fi(a) is proportional to \\ a \, where |cr| is the cardinality of a. That is 

where Z is a normalizing quantity, i.e. Z = X^ CT( =n ^ • ^ * s usuai ly called partition function. 

In this work we propose a Markov Chain Monte Carlo algorithm for sampling from the two models 
above. The general form of the algorithm is as follows: 



Algorithm: The input is a graph G = (V, E), a number err > which is the error in the distribution 
of the output sample and a set of parameters p which specify the target Gibbs distribution p. 

First, the algorithm partitions the set of vertices V into an appropriate set of blocks B. Given B, it 
simulates the following Markov chain and returns the configuration of the chain after T = T(err) steps. 

• Start from an arbitrary configuration. 

• At each transition, the chain chooses uniformly at random (u.a.r.) a block B € B. If X t is the 
current state of the chain, then the next one, Xt+i, is acquired as follows: 

- For every vertex u £ B set X t+ i(u) = X t {u). 

- Set X t +\(B) according to Gibbs distribution conditional that the spins outside B are set 

Xt+i(V\B). 

Remark. Given a set of technical conditions known as ergodicity, it is trivial to show that the above 
chain converges to the appropriate Gibbs distribution jx. 

To deal with the time efficiency and the accuracy of our algorithm we need to address the following 
issues: 

1. The construction of B is done in polynomial time. 

2. There is an algorithm that gives in polynomial time an initial configuration of the chain. 

3. Each transition of the chain can be implemented in polynomial time. 

4. The chain converges to stationarity sufficiently fast, i.e. we have rapid mixing. 

We use mixing time, r m i x , as a measure of the speed of convergence of Markov chains. The mixing 
time is defined as the number of transitions needed in order to guarantee that the chain starting from an 
arbitrary configuration, is within total variation distance 1/e from the stationary distribution (see |pT9l ). 

Remark. In our context, we say that a Markov chain is rapidly mixing if r m j x is polynomial in n. 

Remark. It is not hard to see that the number of transitions that are required to get within error err from 
the stationary distribution is T(err) > In (^z) x T m i x . 

1.2 Our Results 

The main result of this work is in the following theorem. 

Theorem 1 Let d be sufficiently large and let e > be fixed. On input G(n, d/n) and sufficiently small 
err > Ofor the algorithm in Section\LT\the following holds: 

colouring model: W.h.p. over the input graphs G(n,d/n) and for any k > -^-d the algorithm re- 
turns a k-colouring of the input graph distributed within total variation distance err from Gibbs 
distribution. The time complexity scales as In (— ) X 0(n c ),for a fixed c > 0. 

hard-core model: W.h.p. over the input graphs G(n, d/n) and for any A < -^j- the algorithm returns 
an independent set of the input graph distributed within total variation distance err from Gibbs 
distribution. The time complexity scales as In ( — ) x 0{n c ), for a fixed d > 0. 



"e.g. number of colours k for colouring, fugacity for hard core 



Theorem[T]foilows as a corollary from a sequence of results that we present in this section. First, we need 
to remark the following: Up to this point we have not provided any details about the set of blocks B the 
Markov chain uses. These details are a bit technical. However, what is important, for the moment, is that 
each of the blocks in B should satisfy certain conditions which depend on the structure of the underlying 
graph G. In the following two theorems, the convergence holds w.h.p. over the instances of G(n, d/n) 
because (among other restrictions) the underlying graph should admit the appropriate partition B 

Theorem 2 Let M c be the Markov chain on the k-colourings of G(n, d/n) we define in Section 17.71 
With probability 1 — o(l) over the graph instances G(n, d/n) and for k > ^d, the mixing time of \M C 
is O(nlnn). 

Theorem 3 Let Mhc b e tne Markov chain of the hard core model on G(n,d/n) with fugacity A, we 
define in Section 1X71 Let e > be fixed. With probability 1 — o(l) over the graph instances G(n, d/n) 
and for A < -^r the mixing time of Mhc is O(nlnn). 

The following result, Theorem HJ shows that w.h.p. we can construct the set B efficiently. Also, it shows 
that w.h.p. the structure of each block in B is quite simple. The proof of Theorem @] appears in Section 
|2j once we have described in full detail how do we get the set B. 

Theorem 4 With probability 1 — o(l) over the instances ofG(n, d/n) the following is true: There is a 
small fixed s > such that the block construction can be made in time 0{n s ). Each block in B is either 
a tree or a unicyclic graph. 

Theorem |4] implies that, w.h.p. the set B is such that the updates in both chains {M. c and M.hc) can be 
implemented efficiently by using dynamic programming (D.P.). The method is standard and it is based 
on reducing the update to computing the so called partition function for a block B with fixed boundary. 
That is we use D.P. to compute the partition function for each block B. The extra edge in the unicyclic 
blocks should not pose any particular problem in the D.P.I5 

Finally, for the initial state of the chain we work as follows: For Mhc we can trivially consider 
the empty independent set as the initial state. For the chain M. c we can get an initial state by using 
the algorithm suggested in [12]. The authors there provide a greedy, polynomial time algorithm which 
/c-colours typical instances of G(n, d/n) for any k > (1 + c) In d/d and any fixed c > 0. 

1.3 Convergence - Proof technique 

We show rapid mixing by using the well-known Path Coupling technique in [31. The technique goes as 
follows: W.l.o.g. we consider the colouring model. Assume that the underlying graph G = (V, E) is of 
maximum degree A and, for the moment, let k > A. Finally, assume that we have the Markov chain M 
on the £>colourings of G with single vertex updates. 

Consider any two copies of M at state Xq,Yq, respectively. We take Xq, Yo so that they have exactly 
one disagreement, i.e. their Hamming distance H(Xq,Yq) is equal to 1. The coupling carries out one 
transition of each copy of M. Let X\ , Y\ be the colouring after each transition, respectively. A sufficient 
condition for rapid mixing is to exist a coupling for the transitions of the two copies of M such that 

E[H(x 1 ,Y 1 )\x ,Y }<i-&(n- 1 ). (i) 

To study the technique further, assume now that for w G V we have Xq{w) ^ Y(w). It is natural to 
use a coupling that updates the same vertex in both copies. The cases that matter are only those where 
the coupling chooses to update either the disagreeing vertex w or one of its neighbours. If the update 



3 For a more detailed treatment on how the update should be, the reader is referred to 2). 



involves the vertex w, then we get that X\ = Y\. This happens with probability 1/n, where \V\ = n. 
On the other hand, if the update involves a neighbour of w, then X\ , Y\ may have an extra disagreement. 
In particular, the update of a neighbour of w can generate an extra disagreement with probability at 
most jrr& ■ Since the disagreeing vertex w has at most A neighbours, the probability of having an extra 
disagreement is at most — jtta- For k > 2 A + 1, it is direct that CQ) is satisfied. 

W.h.p. G(n, d/n) is of maximum degree ( J" n ). That is, a vanilla path coupling would require 
an unbounded number of colours. Otherwise, i.e. if k is smaller than the maximum degree, there is no 
control on the expected number of disagreements generated in the coupling. However, it is possible to 
gain some control on the expected number of disagreements by using (appropriate) block updates rather 
than single vertex updates in the Markov chains. In particular, the blocks should be constructed in such 
a manner that the high degree vertices are somehow "hidden" inside the blocks q 

In our setting, we consider two copies of M. c at states Xq,Yq. The states differ only on the assign- 
ment of the vertex w. The coupling chooses uniformly at random a block B from the set of blocks B 
and updates the colouring of B in both chains. It turns out that the crucial case for proving CQ) is when 
the outer boundary of B is not the same for both chains, i.e. the vertex w is at the outer boundary of the 
block B |j. There, we need to upper bound the expected number of disagreements, the vertices which 
take different colour assignments after the update of colouring. The construction of B should minimize 
the expected number of disagreements. 

To this end, we improve on an idea from |@]. We use the well-known "disagreement percolation" 
coupling construction, 0. The disagreement at the boundary prohibits identical coupling of X\{B) and 
Y\{B). The disagreement percolation assembles the coupling in a stepwise fashion moving away from 
w. Disagreements propagates into B along paths from w. A disagreement at vertex v! S B at distance r 
from w propagates to a neighbour u at distance r + 1 if X\(u) ^ Y\(u). The disagreement percolation 
is dominated by an independent process such that each vertex v € B is disagreeing with probability 

J 1R&3J3 ifA(«)<(H-a)d 

\ 1 otherwise, 

where a > is some small constant and k > 5.5d. The disagreement propagates over the path L that 
start from w with probability at most riueLUuj) Qu- The expected number of disagreements is at most 
the expected number of paths of disagreements that start from w and propagate inside B. 

Intuitively, high degree vertices are expected to have an increased contribution to the number of 
disagreements. Mainly this is due to the following reason: If a high degree vertex is disagreeing, it has 
an increased number of neighbours to propagate the disagreement. However, for typical G(n, d/n) and 
k > ^-d, it turns out that the larger the distance between a high degree vertex from w the less probable is 
for the disagreement to reach it. This, somehow, can balance the increased contribution that high degree 
vertices have. We exploit this observation in the block construction so as to control the overall number 
of disagreeing vertices in the updates. 

To be more concrete, we introduce a weighting schema as follows: Each vertex u, of degree A(tt) 
in G(n, d/n), is assigned weight W(u) such that 

r a + 7)- 1 ifA(«)<(i + t*)d 

W(u) = { (2) 

I d c ■ A (it) otherwise, 

for appropriate real numbers a, 7, c > 0. Given the weights of the vertices, each block B € B should 
satisfy the following two properties: 

(a) B is either a tree or a unicyclic graph 

4 This is the approach that is used in the analysis in both El 1221 . 
3 w ^ B but w is adjacent to vertices in B 



(b) For every path L between a vertex at the outer boundary of B and a high degree vertex inside B it 
should hold that \[ ueL W{u) < 1. 

The property (a), above, guarantees that the updates can be implemented efficiently. The property (b) 
guarantees that the disagreement percolation coupling construction we describe above satisfies (Q}. 

In the weighting schema, observe that the low degree vertices reduce the weight of the path L, while 
the high degree vertices increase it. Restricting the weight of a path between a high degree vertex in the 
block B and a boundary vertex, somehow, guarantees that the high degree vertices are sufficiently far 
from the boundary. I.e. so as to keep the weight of the path low we require a sufficiently large number 
of low degree vertices between the boundary vertex and the high degree vertex. 

Remark. Taking a path L in G(n, d/n) we expect that the product of the weights of its vertices is rather 
low. Mainly, this is due to the fact that only a very small fraction of vertices have large weight. That is, 
for each u <E L it holds that Pr[A(u) > (1 + a)d] < exp (-a 2 d/3) . 

For a path L in G(n, d/n) with \L\ vertices, we show that the upper tail of its weight is heavy. I.e. 
Theorem [8] implies that the probability that the weight of L is greater than 1 is at most exp(— d 0,8 \L\). 

Remark. We follow exactly the same approach to show rapid mixing of Markov chains for the hard core 
model. The partitioning of the vertices of the underlying graph is exactly the same. The only difference 
is the probabilities that the disagreement propagate inside the block B. 

2 Block Creation & Proof of Theorem |4] 

The process of creating the set of blocks B considers three positive real numbers a, 7 and c as parameters, 
i.e. B = B(a, 7, c). Consider a, 7 and c to be fixed numbers, i.e. independent of d. Their exact value 
will be specified later. Consider, also, the graph G(n, d/n) and the weighting scheme in © for its 
vertices. Given the weights, we introduce the concept of "influence". 

Definition 1 (Influence) For a vertex v, let V(v) denote the set of all paths of length at most -£j^ that 
start from v. We call "influence" on the vertex v, denoted as E(v), the following quantity: 

E(v) = max J TT W(v) \ . 

A vertex v is considered to be under no influence if E(v) < 1. The vertices under no influence are 
special for the blocks construction as they are used to specify the boundaries of the blocks. 

Definition 2 (Break-Points & Influence Paths) A vertex v such that E(v) < 1 is called "break-point". 
Also, a path L that does not contain break-points is called "influence path". 



Block Creation: We have two different kinds of blocks. For this, let C denote the set of all cycles of 

1 Inn 
ln 5 d 



length at most 4^r in G(n, d/n). 



1 . For each C G C we have a block which contains every vertex v € C as well as all the vertices that 
are reachable from v through an influence path that does not use vertices of C\{v}. 

2. The remaining blocks are created as follows: Pick a vertex v which does not belong to any block, 
so far. Consider as a new block the vertex v and all the vertices that are reachable from v through 
an influence path. 



Remark. From the construction it is apparent that each break-point is a single vertex block. 

The reader may have noticed that there is no "heavy" path of length at most %^ that starts from a break 
point. However, in Section [T31 we mention that the vertices at the boundaries of each block B should 
not be connected to a high degree vertex inside B with a heavy path. That is from the above definition 
of break-points and the block construction there may be undesirable heavy paths between the boundary 
of a block B and some of its vertices. It turns out that, w.h.p. none of the blocks in B has such a path. 

2.1 Proof of Theorem 1 

For typical instances of G(n,d/n), it turns out that the influence paths we need to consider for the 
creation of the set of blocks B are rather short. Consequently, each of the blocks that are created is either 
a tree or a unicyclic graph. To be more concrete, the situation is as follows: W.h.p. cycles as those in 
the set C are far apart from each other in G(n, d/n), e.g. at distance greater than Inn/ In 2 d. We are 
going to show that all the influence paths we need to consider are of much shorter length, e.g. at most 
In n/ In 5 d. Therefore, no two cycles will be connected and no new cycles will emerge. 

In the statement of the following theorem, we call elementary every path L in G(n, d/n) such that 
there is no cycle shorter than 10 In n/d 2 ' 5 which contains two vertices of L. 



Theorem 5 Consider G(n, d/n) and the fixed numbers 7, c > 0, a € (0, 3/2). Let the set U contain 

Inn 
ln 5 d 



all the elementary paths in G(n, d/n) of length -r^-z that do not have any break-point. It holds that 



PrfU/0] <n~^ lnd . 

The proof of Theorem [5] appears in Section [7] 

Remark. From Theorem [5] and our previous remark about the mutual distance of the cycles in C the 
following observation is direct: W.h.p. the construction of B considers only influence paths that are 
elementary. 

Using Theorem [5] it is direct to prove the following two lemmas. 

Lemma 1 With probability at least 1 — 2n~ 3 < 4 over the instances G(n, d/n), the blocks that are created 
are either trees or unicyclic graphs. 

The proof of Lemma Q] appears in Section [T3. II Also, we have the following lemma holds. 

Lemma 2 With probability at least 1 — n~ 3 ' 4 over the instances of G(n, d/n), the following is true: 
There is a small fixed s > such that the block construction can be made in time 0(n s ). 

The proof of Lemma [2] appears in Section [T3 .3 1 

Remark. Theorem |4]follows as a corollary of the two lemmas above. 

3 Convergence 

3.1 Colouring Model - Proof of Theorem H 

Ergodicity. For Theorem [2] first we need to consider when M. c is ergodic. From J31 we have that the 
Glauber dynamics (and hence the block dynamics we have here) is ergodic probability 1 — o(l) over the 
instances G(n, d/n) when k > d + 2. 



Remark. The proof for ergodicity in B] goes as follows: It is shown that if a graph G has no t-coreQ 
then for all k > t + 2 the Glauber dynamics for fc-colouring yields an ergodic Markov chain. Then the 
authors use the result in G3l . which states that w.h.p. G(n, d/n) has no t-core for t > d. 

For our underlying graph G(n, d/n) we require it to have, additionally, the following three properties: 
(A) It can be coloured with d colours or less. (B) Let B = B(a, 7, c) be such that < 7 < a < 10~ 2 
and c = 10. Then, each of the blocks in B is either a tree or a unicyclic graph. (C) For the values of 
a, 7, c as in (B) the following holds: For each B € B and every u at the outer boundary of B there is no 
path L from u to a high degree vertex v! G B such that n M "eL W(u") > lQ 

Lemma 3 With probability 1 — o(l) the graph G(n, d/n) satisfies all the conditions (A), (B) and (C). 

The proof of Lemma [3] appears in Section [T3.2I 

Let Qf denote the family of graphs such that they have no t-core for t > d and they satisfy conditions 
(A), (B) and (C). We show that the chain M. c has a rapid mixing for any k > ^-d, as long as the 
underlying graph G(n, d/n) £ Q£. We show rapid mixing by using path coupling in [3]. In particular, 
Theorem |2] follows as a corollary of Lemma|3]and the following result. 

Theorem 6 Let ftA c be such that the underlying graph G(n, d/n) € Q^ and k > -^d. Then, the 
following is true: Consider X and Y, two states of A4 C such that H(X,Y) = 1. Let Px,Py be the 
transition probabilities of A4 C given that it starts from X, Y, respectively. There is a coupling vx,Y of 
Px and Py such that for ( X' , Y') distributed as in vxy it holds that 

E UXY [H(X', Y')] < 1 - 0.1225/n, (3) 

where E Ux Y [•] denotes the expectation w.r.t. the distribution vx,Y- 

The proof of Theorem [6] appears in Section @] 

3.2 Hard Core Model - Proof of Theorem [3] 

As opposed to M. c , the chain M.hc is trivially ergodic. Thus, we proceed by showing that it has rapid 
mixing. We require from G(n, d/n) to have the following properties: (A) Let B = B(a, 7, c) such that 
a = eo/2, 7 = £q and c = 10. Each of the blocks in B(a, 7, c) is either a tree or a unicyclic graph. (B) 
For the values of a, 7, c as in (A) the following holds: For each B € B and every u at the outer boundary 
of B there is no path L from u to a high degree vertex v! G B such that n«"eL W(u") > 1. 

Corollary 1 With probability 1 — o(l) the graph G(n, d/n) satisfies both the conditions (A) and (B). 

The proof of Corollary [TJis very similar to the proof Lemma so we omit it. 

Let Qf be family of graphs that satisfy conditions (A) and (B). For A as specified in Theorem |3l 
and underlying graph G(n, d/n) G Qf, we show M.hc has rapid mixing. As in Theorem [2] we use path 
coupling. In particular, Theorem |3]follows as a corollary of the following theorem and Corollary Q] 

Theorem 7 For Mhc such that G(n, d/n) £ Qf and withfugacity X as Theorem\3\specifies, the follow- 
ing is true: Consider X and Y, two states of \hAhc such that H(X, Y) = 1. Let Px, Py be the transition 
probabilities of Mhc given that it starts from X, Y, respectively. There is a coupling vxy of Px and 
Py such that for (X', Y') distributed as in vxy it holds that 

E UXY [H{X\ Y')\ < 1 - (eo - 20e 2 )/n, (4) 

where E Ux Y [-] denotes the expectation w.r.t. the distribution vx,Y and eo = min{e, 1/100}. 

The proof of Theorem [7] appears in Section [5] 



6 A graph without a t-core can have its vertices ordered vi, . . .v n such that vt has fewer than t neighbours in {vi, ...,Vi-i}. 
7 For condition (C) there was a discussion in Section|2] 



4 Proof of Theorem © 

Consider the chain M. c with underlying graph G(n, d/n) E Qf and fe > ^-d. Let X, Y be two states of 
M. c at hamming distance 1 from each other, i.e. H(X, Y) = 1. Furthermore, assume for the vertex w 
that we have X(w) ^ Y(w). We are going to show that there is a coupling vxy of Px and Py that has 
the property stated in ©. For this, we construct a pair (X', Y') distributed as in ux.y by considering the 
two copies of M. c at states X, Y, respectively, and by coupling the next step of these two chains. The 
coupling is such that we choose the same block to update in both chains. 

There are cases where we update the colouring of a single vertex block, i.e. a break-poinlQ There we 
use the notion of maximal coupling transition, as defined in |[2T1 . The update of the colouring of a single 
vertex u, of degree (1 + a)d, is made such that the probability of having X'(u) ^ Y'(u) is minimized. 
This kind of coupling is what we call maximal coupling transition. Let Lx , Ly be the set of colours 
not appearing in the neighbourhood of u under the colouring X and Y, respectively. Since we assume 
k > (1 + a)d, both sets are not empty. Take two mappings fx '■ [0, 1] — > Lx and fy : [0, 1] — > Ly 
such that 

• for each c G Lx, |/ _1 (c)| = r^— r and similarly for Y, 

• {x : fx{%) 7^ fy ( x )} is as small as possible in measure. 

Then, take a uniformly random real U € [0, 1] and choose colour fx(M) for X'(u) and fyiU) for Y'(u). 
Consider now the coupling of the transitions of two chains at state X, Y so as to construction of 
(X', Y'). Let B be the block whose colouring is chosen to be updated in the coupling. Let N = \B\, i.e. 
B is chosen with probability 1/N . We need to consider three cases for the relative position of the block 
B and the disagreeing vertex w. 

Case 1 : The disagreeing vertex w is internal in a block B', i.e. it is not adjacent to any vertex outside 
B'. With probability 1/N in the coupling we have B = B'. Then, we have H(X',Y') = as the 
boundary conditions of B in both chains are identical. Also, with the remaining probability we have 
H(X',Y') = 1. That is 

E UXY [H(X' 1 Y')] = 1-1/N. 

Case 2 : The disagreeing vertex w belongs to a block B' but it is adjacent to some vertices outside 
B' . Also, assume that B' contains more than one vertices. It is easy to check that A(w) < (1 + a)d 
(otherwise condition (C) would be violated). Also, it holds that all the blocks that are adjacent to w are 
single vertex blocks, i.e. they are break-points. 

For this case the following holds: If we don't have B = B' or B adjacent to w, then we have 
H(X', Y')=l. On the other hand, with probability 1/N the coupling chooses to update B' , i.e. we have 
B = B'. Then, as in Case 1, we have H(X',Y') = 0. Also, with probability at most (1 + a)d/N = 
l.Old/N the block B is a break-point adjacent to w. Then, the probability that the break-point gets 
different colour assignment in X' and Y' is at most k _,^ +a \ d = 4^93. as k > ^d. That is, 

E VXY [H{X' ,Y')] <l-0.75/N. 

The update of the colouring of u, a neighbour of w, yields to a disagreement with probability at most 
k-(i+ a )d f° r tne f 0i l owm g reason: We use maximal coupling. Both of the two lists of available colours 
for u contain at least k — (1 + a)d colours and they differ in at most one element. Then it is standard to 
calculate the upper bound for the probability of creating a disagreement on u. 



8 Break points by definition have small degree, i.e. at most (1 + a)d. 



Case 3 : The vertex w is itself a block, i.e. w is a break-point. If we don't have B = B' or B adjacent 
to w, then we have H(X' ,Y')=l. Otherwise, with probability 1/N the coupling chooses to update w, 
i.e. B = {w}. Then we have H(X', Y') = 0. Also, with probability at most (1 + a)d/N = l.Old/N 
the coupling chooses B to be one of the blocks that are adjacent to w. W.l.o.g. assume that B consist of 
more than one vertexQ We need to bound the expected number of disagreements Rb that are generated 
inside B when its colouring is updated. To this end we use the following proposition. 

Proposition 1 For Case 3, there is a coupling such that E[Rb] < °' 8 ^ 88 . 

Using Proposition Q] and the fact that a = 10~ 2 and N < n, we get that 



1 (l + «)rf 0.1225 

N + —n— E \ R b\ < 1 " — 

The theorem follows. 



E UxY [H(X',r)] < l-^+ y -^^E[R B ]<l-^. (5) 



4. 1 Proof of Proposition [T] 

For the proof of the proposition we use an approach similar to one used in JH (the proof of Theorem 1 
(a)). That is, we use the well-known "disagreement percolation" coupling construction in 0. We wish 
to couple X'{B) and Y'(B) as close as possible. However, identical coupling is precluded due to the 
disagreement at the boundary of B. The disagreement percolation assembles the coupling in a stepwise 
fashion moving away from w. Disagreements propagates into B along paths from w. A disagreement 
at vertex v! € B at distance r from w propagates to a neighbour u at distance r + 1 if X'{u) ^ Y'{u). 
Observing that the distributions of X'(u),Y'(u) are invariant under a Glauber dynamics transition, we 
may couple X'(u),Y'(u) by using the maximal coupling. We get that Pr[X'(u) ^ Y'{u)] < A, where 

A= { mtW ifA(u)<(l+a)d 

\ 1 othei-wise. 

To see this consider the following: 

1. the vertex u has at most 2 neighbours which disagree, since B is either a tree or unicyclic graph, 

2. there are at least k — min{/c — 1, A (it)} available colours for u. 

The disagreement percolation is stochastically dominated by an independent process. This implies that 
the following is true. Consider a configuration of the vertices in B such that each vertex it E B is 
either "disagreeing" or "agreeing". Let the product measure V which specifies that the vertex u is 
disagreeing with probability p u = A (where A is as we define above). In this context we call a "path of 
disagreement" every self avoiding path ^j that has all of its vertices disagreeing. 

Let Do be the only vertex in B which is adjacent to w. Consider a "agreeing-disagreeing" configura- 
tion of the vertices in B acquired according to the measure V. In this configuration, let Zj be the number 
of paths of disagreements of length exactly i that start from the vertex vq. It holds that 



E[R B ]<Y,E v [Z l ], 



i>0 

where Ep[-] denotes the expectation taken w.r.t. the measure V. 

Instead of bounding Ep[Zi] w.r.t B, it is equivalent to study the same quantity on the tree T, the tree 
of self-avoiding walks defined as follows: T is rooted at vertex vq, at level i it contains all the vertices in 
B which are reachable from vo through a self-avoiding path inside B of length exactly i. At this point 
we use the following lemma. 



'it will be conceivable from the analysis that this is the worst case assumption. 
10 A path L is called self-avoiding if there are no two Uj, uy £ L such that Uj - 
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Lemma 4 Consider a tree H = (V, E) containing vertices of different degrees. Let li denote the set of 
vertices at the level i in H. For a vertex v £ H let L v denote the simple path connecting it to the root r. 
For L v we define the following weight: 

C p ,s{L v ) = J [ (I{A(m)<s} -P + I{A(«)>s}) , 
u£L v 

where p € [0, 1] and s > is an integer. Assume that for any vertex v such that A(v) > s the following 
condition holds: 

II (^f + dl ° ■ A ^ • Hm) < (1 + 7), (6) 

for some 7 > 0. Assume that d, s are sufficiently large, also d,s>(l+7) while both (s ■ p), (d ■ p) £ 

[ 100(1+7) ' 1+7]- T,Wn ® im P lieS that 

J2 c pA L v)<P-(i-ey vi>o, (7) 

for any 6 < min{l — ps(l + 7), 1 — (ps) 9 ' 10 }. 

The proof of Lemma H] appears in Section |4~2] 

At this point, observe the following: Let L v be a path in T from the root r to some vertex v. 
The probability that L v is a path of disagreement is upper bounded by the quantity C P:S (L V ) where 
P = iirf-n+ = 2 245d ' s = (1 + a )d = LOW. Thus, it holds that 

Ep[Zi] < 2_^Cp,s(L v ), 

where /j is the set of vertices at level i in T. Observe, also, that the quantities p, d and s we consider here 
satisfy the restrictions imposed in Lemma|4]about their (relative) size. Finally, each L v in T satisfies the 
condition in © as the disagreeing vertex w is a break-point and the root of T is adjacent to w. Thus, 
Lemma \4\ implies that Ep[Zi\ < p(l — 6) 1 . So as to bound Ep[Zi] we take the maximum possible value 
for 9, w.r.t. d, s,p. It is direct that 9 < 0.5127. Thus, it holds that 

MZi] < 5^(0.4873)'. 

From the above we get that E[R B ] < 0A f 43 ■ 5^7 = ^f^, as promised. 

4.2 Proof of Lemma |4| 

Proof of Lemma SI Assume that we are given 7, s,p. When there is no danger of confusion, we 
abbreviate C PjS to C. Also let Wi, . . . , w>A(r) denote the children of r in H and let Hj be the subtree of 
H rooted at the child Wj . 

From the condition in © it is direct to see that the root r is of degree at most s. Then, it is trivial to 
verify that (T7J holds, i.e. © is true for i = 0. Consider now that i > 0. Since A(r) < s, it holds that 

A(r) / \ 

Y,C H {L v )=p-Y, E ^(A,) , (8) 

v£k j = l \v€li-i I 
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where the superscripts H, Hj over C denote the tree w.r.t. which we consider the paths and the weights 
C(L V ). Since A(r) < s, a sufficient condition for © to hold is the following one: For each Hj it should 
hold that 

V c H ^L v )< P (i-eyx-, (9) 

^— ' ps 

vek-i 

i.e. if the bound in (© holds, then we can plug it into ® and get ©. 

Observe that in the previous step we used the fact that A(r) < s to "unfold" the condition in fTJ). 
We can repeating the same step on the subtrees of Hj. This gives rise to a weighting over each path L v 
from rtovE l{. We denote this weight as R(L V ). It is not hard to see that for L v we get that 

K(L v )=p(i-ey x (1Y " x J] * x (i {AM <Ai {A(tl)>s} ) , (io) 

Vi ^ y ueM\{v] K ' v ^ y 

where the set M contains all the vertices in L v of degree larger than s while |M\{t>}| = q. Furthermore, 
it is direct to see that (UJ holds as long as for every path L v between r and v S k we have that 

K{L V ) > 1. (11) 

The lemma will follow by showing that (fTTT ) holds. We consider three cases, depending on the degree of 
v, the last vertex in L v and the cardinality of the set M: (A) A(v) < s, (B) A(v) > s and |M\{t;}| = 0, 
(C) A(v) > s and |M\{u}| > 1. We show the following results. 

Claim 1 For the case (A), the condition in ([77]) is true as long as 9 < 1 — ps(l + 7). 
Claim 2 For the case (B), the condition in f liil) is true as long as 6 < 1 — (ps) 9 ' 10 . 
Claim 3 For the case (C), the condition in Alii is true as long as 9 < 1 — ps(l + 7). 
Clearly, condition in (fTTT ) holds for 6 < min{l — ps(l + 7), 1 — (ps) 9 / 10 }. The lemma follows. o 

Proof of ClaimQ} Consider case (A). For each path L v , the condition © implies that 

—±—l[ d ™.A(u)<(l + 1 ). 



If q > 1, we have that 



(I+7V 



■W.) = Mi-«)^i n (AW)" 1 a (1-^(^1^)) 



1 + 7 

u6M\{»} 



i-g y [d 1 V(i+7) J 



v ps(l + 7)y 1 + 7 \P s (l + 7). 

fd 1(, ps(l+7)l 9 

where the last inequality follows from the fact for large d we have that ^ ^r. ^ 1- 

If g = 0, then it is direct to see that 1Z(L V ) = ( -^ J . In any case, the claim follows. 
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Proof of Claim |2j Given the assumptions of the case (B), the condition in Q implies that , * y d 10 
A(v) < 1 + 7. In turn, this implies that i > n \ n , l+ V — 1- Then, we have that 

n{L v ) = P (i-^J-> 



/10 



(psY ~ (ps)*/ 10 \(ps) 9 / 
The claim will follow by showing that (ps) 4 / 10 < p. Observe that we have that ps < 1 and i > 

ln(l+ 7 ) L inatls > 

1 ln(ps) 



■/in 1 nno_^J^22 -L m in , / 1 V 1 /in 

(;ps) 4/10 < (p S )T0 ln(l+ 7 ) TO < (d 10 A( V )) Ht^) ■ (pg)- 1 / 10 

< {d 10 A(v))~To{psy 1/10 [since ps< 1/(1+7)] 

1 ( (ps)- 1 \ 1/10 ^ 

~ d(A(v)y/ 20 \A(v)V 2 J ~ P ' 

The last inequality follows by taking large d, s (A(v) > s). The claim follows. o 

Proof of Claim [3j For the case (C), the condition in © implies that 

- \. TT d 10 ■ A(u) < 1 + 7 . 

K " u£M\{v} 

where q > 1, i.e. M\{v} / 0. Due to our assumptions about d, p, s and 7, we made in the statement of 
Lemma|4]it holds that j^—[d l0 ps(l + j)] q S> 1, for large d. Thus, we have that 

K(L V ) = p{l-8y—!— FT (A(n))~ 1 >-^(l-0) i f— -± r) d 10q 



> _?_ (i_0)*(_i — -) \d io P s(i 

- I + 7 1 J \ps(l + 7 )J 



> 



1 



v ps(l + 7) 
The claim follows. 



since — £— (d 10 p5 (l + 7 ))« » 1 

1 + 7 



5 Proof of Theorem |7] 

Consider the chain A-f^c with underlying graph G(n, d/n) G ^ and A as in the statement of Theorem[3] 
Let X, Y be two states of Mhc at hamming distance 1 from each other, i.e. H(X, Y) = 1. Furthermore, 
assume for the vertex w that we have X(w) 7^ Y(w). We are going to show that there is a coupling 
vx,Y of Px and Py that has the property stated in ©. For this, we construct a pair (X', Y') distributed 
as in vxy by considering the two copies of M. c at states X, Y, respectively, and by coupling the next 
step of these two chains. The coupling is such that we choose the same block to update in both chains. 
Let B be the block whose configuration is chosen to be updated in the coupling. Let N = \B\, i.e. 
B is chosen with probability 1/N. We need to consider three cases for the relative position of the block 
B and the disagreeing vertex w. The cases we consider are exactly the same as those we considered for 
the colouring model in the proof of Theorem [6] 
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Case 1 : The disagreeing vertex w is an internal vertex of a block B', i.e. it is not adjacent to any 
vertex outside B'. It is direct to see with probability 1/N in the coupling B = B'. Then, we have 
H(X' , Y') = as the boundary conditions of B in both chains are identical. Also, with the remaining 
probability we have H(X' , Y') = 1. That is 

E UXY [H(X' 1 Y')] = 1-1/N. 

Case 2 : The disagreeing vertex w belongs to a block B' but it is adjacent to some vertices outside 
B'. Assume, also, that B' contains more than one vertices. It is easy to check that A(w) < (1 + a)d 
(otherwise condition (B) would be violated). Also, it holds that all the blocks that are adjacent to w are 
single vertex blocks, i.e. they are break-points. 

For this case the following holds: If we don't have B = B' or B adjacent to w, then we have 
H(X',Y')=1. Otherwise, with probability 1/N the coupling chooses to update B', i.e. we have B = 
B'. Then, as in Case 1, we have H(X',Y') = 0. Also, with probability at most (1 + a)d/N = 
(1 + ^)d/N the block B is a break-point adjacent to w. Then, the probability that the break-point 
becomes disagreeing is at most j^ < -^ + ^p. Then it is direct to get that 

E UxY [H{X\Y')}<l-l/(2N). 

The update of the spin of u, a neighbour of w, yields to a disagreement with probability at most j^ 
for the following reason: W.l.o.g. assume that X(w) is occupied, i.e. w belongs to the independent set 
and Y(w) is unoccupied. Clearly X(u) cannot be occupied. The only way we can have disagreement is 
when all the neighbours of u, apart from w, in both configurations are unoccupied. Then, Y{u) becomes 
occupied (disagreeing) with probability j^. 

Case 3 : The vertex w is itself a block, i.e. w is a break-point. If we don't have B = B' or B adjacent 
to w, then we have H{X' ,Y')=1. Otherwise, with probability 1/N the coupling chooses to update w, 
i.e. B = {w}. Then we have H(X', Y') = 0. Also, with probability at most (1 + a)d/N the coupling 
chooses B to be one of the blocks that are adjacent to w. W.l.o.g. assume that all these blocks consist of 
more than one vertexr'l We need to bound the expected number of disagreements Rb that are generated 
inside B from the update. To this end we use the following result. 

Proposition 2 For Case 3, there is a coupling such that E[Rb] < (l — feo + 20£q) \ + ]$■ 

Using Proposition |2] and the fact that N < n we get that 

E„ A mx>X)] < i-^ + ^^lsi- 5 ^. 

The theorem follows. 

5.1 Proof of Proposition |2] 

The proof of this proposition is veiy similar to the proof of Proposition Q] for the colouring model. 
That is, we use the "disagreement percolation" coupling construction from citeDisPerc. We consider 
configurations such that each vertex u G B is either "disagreeing" or "agreeing". We let the product 
measure V specify that the vertex u is disagreeing with some appropriate probability p u . 

We consider a "agreeing-disagreeing" configuration of the vertices in B acquired according to the 
measure V. In this configuration, let Zj be the number of path of disagreements of length exactly i that 



"it will be conceivable from the analysis that this is the worst case assumption. 
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start from the vertex vq. The vertex vq € B is the only vertex which is adjacent to the disagreeing vertex 
w. It holds that 

E[R B ) <Y,E v [Zii 

i>0 

where E-p[-] denotes the expectation taken w.r.t. the measure V. 

Compared to the case of colourings, the only difference we have here is the value of the probability 
p v in the disagreement percolation. Observe that for every u S B, the distributions of X'(u),Y'(u) 
are invariant under a Glauber dynamics transition. We can couple X'(u),Y'(u) by using the maximal 
coupling. Working as in the case of a single vertex update case of Theorem [TJ we get that p u = j^. 

Having specified the measure V we work as follows: Instead of bounding E-p[Zi] w.r.t B, it is 
equivalent to study the same quantity on the tree T, the tree of self-avoiding walks we defined in the 
proof of Proposition [Q At this point we use the following lemma. 

Lemma 5 Consider a tree H = (V, E) containing vertices of different degrees. Let li denote the set of 
vertices at the level i in H. For a vertex v G H let L v denote the simple path connecting it to the root r. 
For L v we define the following weight: 

Cp,s\Ev) = P ) 
where p G [0, 1]. Given d,s> 0, for any vertex v such that A(-u) > s the following condition holds: 

n (^f^f + dl ° ■ A{u) ■ ^-0 * (i + 7) ' (i2) 

for some 7 > 0. Assume that d, s are sufficiently large and d, s > (1 + 7), while both (s ■ p), (d ■ p) G 
[iSoTTv I+rl- Then <G2J implies that 

Y, C pA l v) < P ■ (1 " Of Vi>0, (13) 

for any 8 < 1 — ps(l + 7). 

The proof of Lemma [5] appears in Section I5T21 

Let Ao = ^f 1 for fixed eo = min{e, 1/100}, and let d be sufficiently large. Let L v be a path in T 
from the root r to some vertex v G k, where li the vertices at level i in T. The probability that L v is a 
path of disagreement is upper bounded by the quantity C PtS (L v ) where p = j^-, s = (1 + eo/2)d and 
let 7 = €q. Thus, it holds that 

E-p[Zi] < 2 jC PtS (L v ). 

Observe that the quantities p, d and s we consider here satisfy the restrictions imposed in Lemma|5]about 
their (relative) size. Finally, L v satisfies the condition in (fT2l ) as the disagreeing vertex w is a break-point 
and the root of T is adjacent to w. Thus, Lemma [5] implies that E-p[Zi] < p(l — 9) 1 . So as to bound 
Ep[Zi] we take the maximum possible value for 9, w.r.t. d, s,p. According to Lemma|5]it should hold 

that < 1 - ps(l + 7) = 1+6o/ 2 2 ~ 6£ ° . We get that 



El*,] =f<(l-§„ + **)£ + £ 



The proposition follows. 
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5.2 Proof of Lemma |5] 

Proof of LemmaUt When there is no danger of confusion, we abbreviate C P:S to C. Also let w±, . . . , WA(r) 
denote the children of r in H and let #_, be the subtree of H rooted at the child Wj. 

From the condition in (fT2l ) it is direct to see that the root r is of degree at most s. Then, it is trivial to 
verify that (fT3T ) holds, i.e. (TT3T > is true for 2 = 0. Consider now that i > 0. Since A(r) < s, it holds that 

A(r) / \ 

where the superscripts # , i?j over C denote the tree w.r.t. which we consider the paths and the weights 
C(L V ). Since A(r) < s, a sufficient condition for (fPTt to hold is the following one: For each Hj it 
should hold that 



£ c fl] (g<?(i-«)'x-, 



(15) 



i.e. if the bound in (fT31) holds, then we can plug it into (fl4l ) and get (TT3T >. 

Observe that in the previous step we used the fact that A(r) < s to "unfold" the condition in (PT3l) . 
Repeating the same step on the subtrees of Hj gives rise to a weighting over each path L v from r to 
v £ li. Let us denote this weight as R(L V ). It is not hard to see that for L v we get that 



R(L v )=p(l-0) i x(-) tq x J] 7\fr x1 ' 



(16) 

MGAf\{u} 

where M contains all the vertices in L v of degree larger than s while |M\{t>}| = q. Furthermore, it is 
direct to see that (TT3T > holds as long as for every path L v between r and v € k we have that 

n{L v ) > 1. (17) 

It suffice to show that (fTTT ) holds. Observe that, for each path L v , the condition (fT2l) implies that 

? ^-— TT d 10 • A(n) < (1 + 7). 

If q > 1, we have that 






11 / 1 V~ 9 d l °1 
K{L V ) = p(l-e) 1 -—. (A(u)) _1 > (1 - 0)' ( — 

v ; FV ; (ps) l -ip H v v " - v ' \ ps {i + 1 )J i + 7 



«6M\{«} 

i-g y [d 1 V(i+7)] g > ( 



ps(i + 7)y 1 + 7 \ps(i + 7)y 

\d 10 ps(l+y)] q 

where the last inequality follows from the fact that - jq— — — 3> 1 for large d. 

If q = 0, then it is direct to see that 1Z(L V ) = ( ^—^- J . Clearly, in any case it should hold that 
6 < 1 — ps(l + 7). The lemma follows. o 
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6 Tail Bounds 

6.1 A Tail bound for weighting schema 1 

Consider the setting where we assign the vertices of G(n, d/n) weights as in Section [T31 in (O. That is, 
consider a graph G n ^/ n an d fixed numbers a, 7, c > 0. Consider a path L = v 1 , . . . , v\n in G(n, d/n). 
For a vertex Vi € L we let the quantity W(uj) be such that 

W{ Vi ) = { (1 + 7) _1 if AK) < (1 + a)d 

d c • A(vj) otherwise, 

where A(-Uj) is the degree of Uj. Also, let 

c(L) = n w( V ). 

A very useful result for the analysis that follows is the following theorem. 

Theorem 8 Let P be a path in G(n,d/n) with a number of vertices \P\ < 100 Inn. For fixed a € 
(0, 3/2), 7 > 0, c > and any 5 > it holds that 

Pr[C{P) >5}< exp \-d i/5 {\P\ + In 5) 

The proof of Theorem [8] appears in Section [TO] 

6.2 A tail bound for weighting schema 2 

We consider the weighting schema we are going to use in Section [8] That is, consider a path L in 
G(n, d/n) and for each vi 6iwe let Ni denote the set of the vertices outside L which are adjacent to 
Vi, i.e. \Ni\ = A(vj) — 2. For every w G Ni let E out (w) denote the influencel 12 lon vertex w, only from 
paths of length at most In n/d 2 ' 5 that do not use vertices in L. For every v% we let 

Q(vi) = max {E out (w)} . 

w£Ni 



Now, we associate each vertex in L with the following quantity 

C max{l,Q(«j)} 
U(Vi) = \ 1+7 

I max. {I, Q(vi)} ■ d c ■ A(vi) otherwise. 

Also, we let 



if A(vi) < (l + a)d 



AC(L) = H U(ui). 

Vi£L 

Also, we remind the reader that a path L is called elementary if there is no cycle of length less than 
10 In n/d 2 / 5 that contains two distinct vertices of the path. 

Theorem 9 Let P be an elementary path in G(n, d/n) with a number of vertices \P\ < Pr-,- For fixed 
a G (0, 3/2), 7 > 0, c > and any 5 > it holds that 

Pr[AC(P) >S}< 2 exp \-d 7 ' w (|P| +ln«5) 

The proof of Theorem [9] appears in Section [T2] 



"For a definition of the concept influence, see in Section|2] DefinitionQ] 
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7 Proof of Theorem 

For proving Theorem [5] (i.e. in Proposition [3} we need to use the tail bounds from Section [6] 

Let L = vi, . . . , ui£| be an elementary path in G(n, d/re) of length T = t^^t. Also, let £l be the 
probability that L does not have a break point. It holds that 

n \ /a 



Pr[U^0] < £[|U|] < ( )'( _ ) ' SL [by linearity of expectation] 

< (^r) T+1 '■ (i) 7 ■ SL [« (?) < (ne/i)*] 

< nci T • £ L < n L1 • ei . [ asT= i ^] (18) 

Thus, it suffices to bound appropriately ql. Consider the weighting schema in in Section [T31 in (fSJ). 

Definition 3 A vertex V{ E L is called left-break or right-break for L if it has the corresponding prop- 
erty below: 

right-break: There is no path L' 6 V(v) such that Ylv-el/ W(vi) > 1 and L'fll contains Vjforj < i, 
left-break: There is no path L' £ V(v) such that Y\ v gL i W(vi) > 1 and L' n L contains Vjforj > i. 
(For the definition of the set V(u) see in Definitiotfl}) 
Remark. Clearly, Vi £ Lis break point if and only if it is both a left-break and a right-break for L. 

Proposition 3 Consider the elementary path L of length of -r^- in G(n,d/n). Let Yj and Y r be the 
number of left breaks and the number of right breaks for L, respectively. Let the event 

S = "either Y\ or Y r is smaller than 0.9|L| ". 

For a, j, cas in the statement of Theorem\5\and sufficiently large d, it holds that 

Pr[S] < 8n~5TT7 lnd . 

The proof of Proposition [3] appears in Section [8] 

Given that Yj, Y r > 0.9 |L|, it is easy to see that the number of vertices in L which are at the same 
time left-break and right-break is at least 0.8|L|. Thus, we get that 

QL < Pr[S] <8n~^ lnd . 
The theorem follows by plugging the above inequality into (TT8T ). 

8 Proof of Proposition [3] 

It is not hard to see that the random variables Yj and Y r are symmetric, i.e. identically distributed. We 
will focus only Y r and we will show that: 

Pr[Y r < 0.9|Lj] < 4n"5TT^ ln(i . (19) 

Then the proposition will follow by a simple union bound, i.e. 

Pr[S] < Pr[Y r < 0.9|L|] + Pr[Yi < 0.9|L|] < 2Pr[Y r < 0.9|L|], 
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where the last inequality follows from symmetry. 

For each v% S L let N{ denote the set of the vertices outside L which are adjacent to Vi, i.e. \Ni\ = 
A(vi) — 2. For every w £ N{ let E out (w) denote the influence on vertex w, only from paths of length at 
most lnn/d 2 ' 5 that do not use vertices in L. For every Vi let 

Q(vi) = max {E out (w)} . 

Now, we associate each vertex in L with the following quantity 

U{ Vi ) = \ m&A l^ )} if A( Vl ) < (1 + a)d 

I max {1, Q(vi)} ■ d c ■ A(vi) otherwise, 

where a, 7, c are defined in the statement of the proposition. 

Remark. The reader should observe that due to the assumption that L is an elementary path the weight 
of each vertex in L is independent of the weight of the other vertices in L. 

It is straightforward that any vertex V{ such that U{vi) > 1 cannot be a right break for L. Let 

H = {Vj e L\U{v 3 ) > 1}. (20) 

For each Vj G H let IZj = Vj,Vj + \ , . . . , v s be the maximal subpath of L such that for any f £ [j, s] it 

holds l\Cj U (Vr) > 1. 

Lemma 6 A vertex vy £ H is a right break if there is no Vj € H such that j < j' and vy £ IZj. 

The proof of Lemma [6] appears in Section [13.41 From Lemma [6] we get the following: Letting 1Z = 
UjTZj, it holds that Y r = \L\ — \1Z\. The proposition will follow by deriving an appropriate tail bound 
for \1Z\. That is, it suffice to show that 

Pr[\R\ >0.1[L|] < 2n"5T+^ lnd . (21) 



To this end, we work as follows: Let A be the event that "the maximum degree in G(n, d/n) is at most 

lnrf 
37710 



In 2 n". Let, also, F be the event "H has cardinality less than -=7^ In n". For any t > 0, we have that 



Pr[\R\ > \L\/10] < Pr[A c ] + Pr[¥ c ] + Pr[\R\ > |L|/10|A,F] 
< Pr[A c ] + Pr[¥ c ] + Pr[e tm > e t|L|/10 |A,F] 

E [e'l^l |A, Fl 

where in the last inequality we used Mai - kov's inequality. We need to bound Pr[A c ], Pr[¥ c ] and 
E [e*l^l I A, F] . As far as Pr[F c ] is regarded, we use the following lemma. 



Lemma 7 For a, 7, c as in the statement of Proposition^ it holds that 

Pr 






, — o tj— In d 
< n 2 1+ T 



The proof of Lemma [7] appears in Section [87T1 As far as Pr[A c ] is regarded we get that 

Pr[A c ] < nPr[A(v) > In 2 n] < exp (- In 2 n) . (23) 

In the first inequality, above, we use a union bound (over all vertices), while the second inequality 
follows from Chernoff 's bound. As far as E [e*! 7 ^ | A, F] is regarded, we use the following proposition. 
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Proposition 4 For the path L and a, 7, c as in the statement of Proposition\3\the following is true: For 
any < t < d 3 / 5 it holds that 



E 



e i|7e| I A, f| < exp ( 2d 3/w In d ■ In 



. ?) 



The proof of Proposition [4] appears in Section [9] Plugging into (l22l) the bounds from Proposition HJ 
Lemma|7J (T23T ) and setting t = <i 3 / 5 we get (|2TT ). as promised. The proposition follows. 

8.1 Proof of Lemma 

First we are going to compute Pr[t>j £ iJ] = Pr[C/(fj) > 1]. Proposition [5] (in Section [TOl implies that 
£ [l7*(ui)|A] < 2exp (-j^t), for any 1 < t < d 7 / 10 . Then, for £ = d 7 / 10 we have that 

Pr [!/(«<) > 1|A] = Pr[U\vi) > ljA] < E [U^v^A] 
S 2e,p(-^'/'»). 

Let iV = -4^7 In n. If |ij| > iV, then there should be a subset B of vertices in L such that |P| = iV and 
B CH.lt holds that 

( Inn \ / \ iV 

ln5d ) exp( — <i 7/10 



In 2 

< nin 5 d exp 



-^L-d 7 ' w N 
1 + 7 



I n °) 



Jn2 __L lnd _l^_l nd 

< J"Jln 5 d • n 1+ i < n 2 1+ ^ , 



where the last inequality holds for large d. The lemma follows. 

9 Proof of Proposition |U 

Proposition |4]follows as a corollary from the following two lemmas. 

Lemma 8 For a path L and a, 7, c and t as in the statement of Proposition^the following is true: 

e W|A,F 



E 



e f|7ei| |A,F,] 



JT7TU 



Inn 



< [E 

where the event B = u v\ EH" and the set H is defined in ( 1201 ). 
The proof of Lemma [8] appears in Section |9~T1 Also, we have the following result. 
Lemma 9 For a path L and a, 7, c, t as in the statement of Proposition^\the following is true: 



E 



e* |Wl| |A,F,] 



<e 2d . 



The proof of Lemma [9] appears in Section I9T21 
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9.1 Proof of Lemma H] 

We denote the expectation operator E'f-jAjF] as £a,f as well as for Pr[-|A, ¥] we use Pa,f[ - ]- 

Assume that we have j < j' such that TZj n TZy ^ 0. Also, let TZj/ = uy , . . . , w s . Then, it holds 
that IZj = Wj, . . . , w s as IZj , TZy should have the same rightmost vertex (this follows by the definition 
of these sets). Let the set of positive integers 

J = {j £ N\vj G H, and $f < j : IZj C Kj'} . 

It holds that \1Z\ = YljeJ \^j\- Let j\ denote the minimum element in J, j'2 the second smallest and so 
on. Clearly it should hold that ji,J2,J3 < \L\. In what follows, by conditioning that j'2 = \L\ + 1 we 
imply that the set J has at most one element, i.e. j\. 



£? A ,F[exp(t|^|)] 



-Ea,f 


n^ 




_i6J 


|L| + 1 


2 , -^A,F 


s=l 


l^l + l 


2 y ^A,F 


s=l 





JJexp(i|^j|) \h = s 



J] exp(t\Kj\)\j 2 = s 
jeJ\{h} 



Pa,f[J2 = s] 

E[exp(tTZj 1 )\j 2 = s]Pa,f\J2 = s] 



< Ej k ^[exp(t\7Zj 1 1)] • max < 



E 



A,F 



H esxp(t\Tl j \)\J2 = 8 

jeJ\{h} 



Observe that the above factorization holds since the underlying path L we consider is elementary. That 
is there are not two different vertices Vi, Vj G L whose weights are dependent with each other. Assume 
that the above expectation is maximized for s = sq. It holds that 



£7 A ,F[exp(t|ft|)] < EAj[exp(t\1lji\)\ ■ Ea,w 



exp^l^-l)^! 



jeAOi} 



where A\ = "j'2 = sq". Working in the same manner we get that 



£ A , F [exp (t\1Z\)} < J E At¥ [exp (t\1Z ja \) \ n s q=1 A q ], 



where T 



ln ^j In n — 1, since we have assumed that \H \ < -^m In n. Also, the events A Q are defined 



~ 



d 7/10 



in the same manner as the event A\ . 

Consider now the event A q . Observe that TZjS are correlated with each other in the sense that 
\H\ < \L\. Revealing some of them affects only the starting point of the rest, i.e. the events H q=1 A q 
specify the leftmost position that TZj s+1 can start. This is due to the fact that since the underlying path is 
elementary and the weights U(vi)s of the vertices in the path are independent. The further TZj s+1 starts 
the less available vertices it has to extent. E.g. it may be the case that Vj s+1 is the last vertex of the path 
and there are no extra vertices to extend 1Zj s+1 . Considering all the above, it is direct that for any s € J 
the following holds: 

£ AiF [exp(tft is )| n* =1 Aq] < E AiF [exp(t|fti|)|B], 



where the event B ="v\ € H" and the set H is defined in (I201 ). The lemma follows. 
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9.2 Proof of Lemma & 

Let Pr[-|A, F, B] be denoted as P a ,f,b[-]- It holds that 

\L\ \L\ 

e*l^l|A,F,Bl = 2e tr -PA,F,B[|^i|=r]<^e tr -P A ,F,B[|^i|> 



E 



\L\ 



r=l 



< ^e tr -P A , F ,B^C(L lir )>l], 



(24) 



r=l 



where Li r denotes the subpath of L which starts from vertex v\ and ends at the vertex v r . It holds that 

Pr[AC{L 1:T .) > 1,B|A,F] 



Pr[AC(L lt r) > 1|A,F, 



< 



Pr[B|A,F] 
PrlACiL^.) > 1|A,F] 



(25) 



Pr[B|A,F] 
For the nominator we use the tail bound we obtained in Theorem [9] (in Section l6\2l . Then, we get that 

Pr[AC(L 1>r ) > 1|A,F] < 4exp (-d 7/w r) . (26) 

Also, it holds that 

Pr[B|A,F] > Pr[A(ui) > (1 + a)d] > Pr[A(«i) = (1 + a)d] > exp(-2d). (27) 



The last inequality follows from Stirling's approximation and the fact that a € (0, 3/2). Plugging (|26 
and (|27]> into ((25J we get that 



Pr[AC(L hr ) > 1|A,F,B] < exp 
In turn, plugging the above into (l24l we have that 



-d 7/l0 r + 2d 



E 



\M 



e t|%l |A,F,B < 4e M ^exp (t - d 7/10 )r <4e M J]exp -0.8d 7/10 



1 - exp [-0.8d 7 / 10 ] - 



r=l 



[as < t < d 3 / 5 ] 



as e ' 8d °V(l- e a8d0r )<l/4 



The lemma follows. 

10 Proof of Theorem i 

Let the A be the event that "the maximum degree in G(n, d/n) is at most In 2 n". It holds that 

Pr[C(P) >5}< Pr[A c ] + Pr[C(P) > S\A]. 
From the union bound we get that 

Pr[A c ] < nPr [degree(u) > In 2 n] < n exp (—1.2 In 2 n) < exp (— In 2 n) , 
where in the second inequality we use Chernoff 's bound. Consider, now, the quantity 

K{P) = In C(P) = Y, ln W ( u ) = J2 R ( u ^ 



(28) 



(29) 
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where R(u) = In W{u). The proposition will follow by bounding appropriately the probability Pr[K{L) > 
In S] . For a real t > it holds that 

Pr[C(P) > S\A] = Pr[K(P) > ln5\A] = Pr[exp (tK(L)) > d*|A] 

< E [exp (iK"(P)) | A] /&, (30) 

where the last inequality follows from Markov's inequality. For the calculation of E [exp (tK(P)) |A], 
we use the following lemma. 

Lemma 10 Consider the path P = v\ , . . . , v\p\, where \P\ < 100 In n. It holds that 

\P\ 

E[exp(tK(P))\A] < ]lE[exp(tR(vi))\A,Bi], 

i=i 

where the event E>i = "the number of edges between the set Vi = {v\, . . . , Wj-i} and the set V\Vi is 
equal to (i — l)(ln n — 2) and the maximum degree in V{ is at most In n". 

The proof of Lemma [101 appears in Section fT0.il 

Proposition 5 For fixed C > 0, an integer t such that < t < C ■ d A < 5 and any Vi G P the following is 
true: For a, 7, c as in the statement of Theorem\8\it holds that 



E [exp (t ■ R(vi)) [A, Bi] < exp I -y^* ) + 3d^ i+c) exp I -— d 

where the event Bi is defined in the statement ofLemma\W\ 

The proof of Proposition [5] appears in Section fTTJ Using Lemma [TOl and Proposition [5] we get that 

\P\ 



E[(sxp(tK(P))\A] < 



< 



< 



exp ( 2— i J + 3d*( 1+c ) exp (~—d\ 

exp f-^-tlPl") [l + 3d^ exp f-^d + -^-^ 

V 1 + 7 / L V 5 1 + 7 / 

exp (---!—t\P\ + 3^ 1+c ) exp (~d + -^— A \P\] , 

V 1+7 V 5 1+7 / / 



where in the last inequality we used the fact that 1 + x < e x . Taking t = h^-d A ' 5 and sufficiently large 
d we get that 

E[exp(tK(P))\A] < exp (-d^ 5 \P 

The proposition follows by plugging the above inequality into (l30l . 

10.1 Proof of Lemma M 

Proof of LemmaQjB It is elementary to verify that 

ln 2 -l 

E [exp (tK{P)) \A] = Y. E [ ex P ( tK ( P )) ! A > A (^i) = J\ Pr[A(vi) = j\A}. (31) 
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Also, it is direct that 

E [exp (tK(P)) |A, A Vl =j] = E [exp (tR( Vl )) [A, A^ = j] • E 

< E[exp(tR( Vl ))\A,A Vl =j]E 



U'\ 



~[exp(tR(v a ))\A,A Vl 



\P\ 



J exp (tR(v s )) |A, A V1 =ln 2 



;i 



s=2 



The above inequality holds since R(v s )s are increasing with A(v\). Plugging the above inequality into 
(|3TT > we get that 



£ [exp (tK(P)) | A] < £ [exp (t-R(vi)) |A] • £ 



1^1 



] exp (ii^ s )) [A, A( Wl ) = In 2 



/; 



s=2 



The lemma follows by working in the same manner of the remaining vertices in the path. 



1 1 Proof of Proposition |5] 

It is direct that exp (t ■ R(v t )) = W\vi), i.e. E[exp(tR( Vi ))\A, B { ] = S[W^(ui)|A, B*]. 

We distinguish the edges which are incident to v\ into three categories: The first one consists of 
the edges between v% and the vertices in the set {v\, . . . , Vi-2}- Let A int {vi) be the number of these 
edges. The second category consists of the edges between V{ and the vertices not in {v\, . . . , fj+i}. Let 
A ext (vi) be the number of these edges. The third is the edges that Vi used to connect to vi—\ and Vi + \. 
Of course if i = \P\ then there is no uipi+i. I.e. it holds that 

A(v i )<A int (v i ) + A ext (v i ) + 2. 

Conditional on B{, the marginal of A mt (vi) follows the binomial distribution with parameters (i — 
2) (In 2 n — 2), l/(n — i + 1). Also, the marginal of A ext (vi) follows the binomial distribution with 
parameters n — (i + 1), d/n. The joint distribution of A mt (vi), A ext {vi) is such that the degree of Vi is 
at most equal to the maximum degree in G(n, d/n). It holds that 



EiW^v^Bi 



iln 2 n 

E 

3=0 



EiW^v^BuA^ivi) = j]Pr[A mt ( Vi ) = j\A,B 



iln 2 n 



< EiW'iv^A^B^A^ = 0] + V E\W t (v i )\A,B u A! ?=j]Pr[A% t =j\A,B i \, 



3=1 



where we used the fact that Pr[A^ nf = 0|A, Bi] < 1. Also, it holds that 



Pr[Alf = j\A,Bi]< 



Pr[A% t = j\B i ] Pr[Alf = l\Bi] 



Pr[A\Bi] 



< 



Pr[A\Bi] 



(32) 



The last inequality following from the following observation. Given the event Bi, A™' is binomi- 
ally distributed. The function f(j) = Pr[A^ n4 = j\Bi] is decreasing for any j > S[A^ n *|Bj]. As 
i?[A™'|Bi] < In 3 n/n, f(j) is decreasing for every j > 1. That is for any j > 1 it holds that 
Fr[A~* = jl^i] < Pr[A*J* = l\Bi\. Also, it is trivial to show that Pr[A\Bi] > 1/2. 
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Using this fact, the above inequality can be written as follows: 

iln 2 n 

E^ivt^Bi] < E[W t (v i )\A,B i ,A% t = 0] + 2Pr[A™ t = l\B i \. J^ E[W\ Vi )\A, B t , A*f = j] 

, 3 i In 2 n 

< E[W\ Vi )\A, B i> A** = 0] + 200^ £ E[W\ Vi )\A, B t , Ajf = j], (33) 

i=i 

as i < 100 Inn. The proposition will follow by bounding appropriately E[W t (vi)\A, Bi, A^ nt = 0] and 
E[W t (vi))\A, Bi, A^ n * = j]. For this, we provide the following two lemmas. 

Lemma 11 For a, 7, c, t as in the statement of Proposition^ it holds that 

E\W*(vi)\A, B { , A£* = 0] < exp (" j^~*) + 2 ^ (c+1) exp (~yd) ■ 

The proof of Lemma [TTI appears in Section fll.il As far as E[W t (vi)\A, Bi, A^ nt = j] is regarded we 
use the following lemma. 

Lemma 12 For integer j such that 1 < j < In n — 2 and a, 7, c, £ as m f/ie statement of Proposition 
\5\ it holds that 

E[W\vi)\A,Bi,A^ = j] < 5(2d c ) i (lnn) 4t . 

The proof of Lemma [T2l appears in Section [Tl.2l Plugging into (l33l the bounds from Lemmas [TTI [T2l we 
get that 

( lnn )4t+6 



ElW^Vi^Bi] < exp f--2—t\ + 2^ t+c ) exp ( ~A + 1000(2(i) 4 



■/?. 



< 
The proposition follows. 



exp ( ^—t\ + 3d t(c+1) exp (~d) ■ [as t < Cd 4 / 5 ] 



11.1 Proof of Lemma [TTJ 

Proof of LemmaQl} So as to prove the lemma we need the following claim. 

Claim 4 For sufficiently large d, integers q, t such that \q\,\t\ < Cd 4 ' 5 , for any constant C > 0, arcc? 
< a < 3/2 the following is true: 

i=(l+a)<i+g \/\/\ / \ / 

Letting SU = JSfW^^OIA, £*, A™* = 0] we have the following: 

n 

S A = (l + 1 y t Pr[A ext (v l )<(l + a)d-2\A,Bi] + d ct ]T (i + 2) 1 Pr[A ext (vi) = i\A, B^ 

i=(l+a)d-2 
i C t ft / \ / j\ % / i\ n— « 

< (i + 7 )-*+ £ ( ? + 2 )\-j(nJ (^n) [asPr[A^<(l + a)d-2|A,A]<l] 



i=(l+a)d-2 



d t(c+l) 



a 



2 



< (1 + 7)-* + exp — -d [from ClaimH 



Pr[A] " V 4 
< (1 + 7)"* + 2d' (c+1) exp (~^d\ . [as Pr[A] > 1/2] 
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The lemma follows by noting that (1 + 7) < exp I — -p— i ) ■ 

Proof of Claim ffi Let S = Et(i +a ) d+q (i + ^(1) HY (l " 3""*. Using the fact that (7) 
f (£j) , for 1 < i < n, we get that 

i=(l+a)d+g V 7 V / \ / \ 

j=(l+a)d+g-l \ J ' ' \ J / \ / \ / 

It is direct to see that repeating the exactly the same calculation I times, where / < t, and for sufficiently 
large d we get that 

\S=0 / j = (l +a )d+g_Z V J'/ \ J / \ / \ / 

Also, for / = t we see that the above inequality is equivalent to the following 

4-1 
5<d*-e 2 -Pr[i3(n-t,d/n) > (1 + a)d + q - t] • J[ (l + ^V (34) 

s=0 

where the last term expresses the probability that a random variable distributed as in binomial distribu- 
tion with parameters n — t and d/n is at least (1 + a)d + q — t. Using the fact that \q\, \t\ < Cd A ' b and 
standard large deviation results about the binomial distribution, i.e. Corollary 2.3 from lTT4l we get that 

Pr[B(n - t, d/n) > (1 + a)d + q - t] < exp (-a 2 d/4) . (35) 

Also, we have that n* s =o (l + 3) < exp (X)s=o s / d ) ^ ex P ( f (* ~ l )/( 2d ))- Taking large d and using 
the fact that t < Cd 4 / 5 we get that exp (t(t - I)/ (2d)) < exp (C 2 d 3/5 /2). Plugging this result and 
(T35T ) into (1341) we get that S < d 1 exp ( — \d\ , as promised. o 

11.2 Proof of Lemma EG] 

Proof of LemmaSS For S B ,j = E [exp (t ■ R(vi)) [A, £j, A int (^) = j] , where 1 < j < In 4 n - 2, 
we work as follows: 

L J i=max{(l+a)<i-j-2,0} \ / \ / \ / 

< (l + 7 )-i +2( i ct (lnn) 4i |f(l + i/ln 4 n) t ^ (-Y (l - -Y 

< (l+ 7 )- 1 + 2d ct (lnn) 4< ^2 t (' n V- N ) (l - -) [as 1 + i/ln A n < 2 for i < ln 4 n] 



j=0 

< (l + 7 )^ 1 + 2(2d c ) t (lnn) 4i -Pr[S(n,d/n) < In 2 n] 

< 5(2d c ) t (lnn) u . 
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The lemma follows. o 

12 Poof of Theorem [9] 

Let A be the event that "the maximum degree in G(n, d/n) is at most In 2 n". We have that 

Pr[AC(P) >S}< Pr[A c ] + Pr[AC{P) > 5\A], (36) 

from the law of total probability. Also, from the union bound we get that 

Pr[A c ] < nPr [degree (v) > In 2 n] < exp (- In 2 n) . (37) 

where the last inequality follows from Chernoff 's bound. Consider, now, the quantity 

AK{P) = In AC(P) = J2 l nU(vi) = Y, F K), 

v % eP ihgp 

where F(vi) = lnU(vi). The proposition will follow by bounding appropriately the probability Pr[AK(L) > 
In d] . For a real t > it holds that 

Pr[AC(P) > S\A] = Pr[AK(P) > ln5\A] = Prfexp (t ■ AK{L)) > d^A] 

< E[exp(t-AK(P))\A]/5\ (38) 

where the last inequality follows from Markov's inequality. Due to our assumption that P is elementary 
the random variables F(v\), . . . , F(vi P i) are independent with each other. Thus, we have that 

\P\ 
E[exp(t ■ AK(P))\A] = JJ^[exp(i • F( Vi ))\A\. (39) 

i=l 

Proposition 6 Assume that a,j,c are as in the statement of Theorem® For a fixed C > and integer 
t such that < t < Cd 7 ' 10 and any Vi G P the following it true: 

E [exp (t ■ F(vi)) | A] <exp( -— — t ) ( l + 2exp 



1 + 7 J \ \ 2 J J 

The proof of Proposition [6] appears in Section 112.11 Using (l39l and Proposition [6] we get that 

E[exp(t-AK(P))\A] < exp ( --^— t|P| + 2exp [ — J \P\ j . 

For t = 5^^d 7 / 10 and sufficiently large d we get that 

E[exp(t • AK{P))\A] < exp (-d 7/10 \P\) . (40) 

The proposition follows by combining (l40l . (I38T ) (|37T ) and (1361) . 
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12.1 Proof of Proposition |6] 

It is direct that exp (t ■ F(v t )) = C/*(«i), i.e. E[exp(tF(vi))\A] = J5[J7*(« 4 )|A]. 

For a vertex V{ 6 P, let <5(t>j) denote the number of edges between t>; and the vertices outside the 
path P. Observe that the 6(vi) is stochastically dominated by the binomial distribution with parameters, 
n, d/n, e.g. B(n, d/n). By definition it holds that 

In 2 ra-2 

E[tf*(«i)|A] < J]) J B[[/ t (t> i )|A,,5(t; 4 )=j]Pr[ ( 5K)=i|A] 
i=o 

ln 2 n-2 

< J3 ^[^K)|A,<5K) = j] ■E[(^(v i )\A,S(v i ) =j] -PriSivi) = j\A], (41) 

3=0 

where W(vi) is the quantity defined in ©. Observe that the random variables W{vi) and Q(vi) are inde- 
pendent with each other given S(vi). For this reason it holds that ^[[/^(i^IA^j)] = E[W t (vi)\A(vi)] ■ 
E[F l (vi)\A(vi)]. We are going to bound these two expectations by using the following lemmas. 

Lemma 13 For t, a, 7, cas in the statement of Proposition® it holds that 

E[Q\ Vi )\k, 5{ Vi )} <l + 2d- 5{ Vi ) exp (~d 4/5 ] 
The proof of Lemma [TBI appears in Section [T2.2I Plugging (1421 ) into (|4TI ). we get that 

In 2 n 

£[l7*(«i)|A] < £[W*(^)|A] + 2dexp (-d 4 / 5 ) 53 i^[W*(^)|A,^) = j]Pr[S(vi) = j\A]. (43) 

i=o 

Lemma 14 For t,a,j,cas in the statement of Proposition® it holds that 

ln 2 n 

53 jElW^v^A, S(vi) = j]Pr[S(vi) = j\A] < 2d(l + 7 )~* + 2^ c+1 ) +1 exp (-a 2 d/5) . 



(42) 



The proof of Lemma [T4l appears in Section [T2.3I 

Lemma 15 For t,a,j,cas in the statement of Proposition® it holds that 

ElW^v^A] < exp f-—L-t\ + 2d t( - c+ ^ exp (-^-d\ . 

We do not provide any proof for Lemma [T51 as it is the same as the proof of Lemma QT] (which appears 
in Section fTl.il) . Plugging into (l43l) the bounds from Lemmas H~4l [151 we get that 



£[E/*(«i)|A] < 



exp 



TT- 7 T^° +1) °H-T d 



1 + exp 



< exp 



1 



1 + 7 



t) l + 2exp 



d A ^ 



d 4 ^ 



[as t < d 7 / 10 ] 



The proposition follows. 
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12.2 Proof of Lemma [B] 

Proof of Lemma H3t For S(vi) = it is direct that Q(vj) = 1. Thus, the inequality (l42l holds for 
S(vi) = 0. In what follows we consider that 5(vi) > 0. 

Let wi, . . . , wg/ v .\ be the neighbours of Vi outside the path P. Also, for Wi let Si be the set of paths 
of length Inn/d 4 ' 5 that start from Wi but do not use the vertices in P. For 1 < j < 5(vi), let £j jX be 
the event that there is a path L € Sj such that C(L) > x. Observe that the events £j )X are identically 
distributed as j varies. Using Theorem [8] we get the following: 

Claim 5 For 1 < q < 5(vi) and any real x > 1, it holds that 

Pr[£ q ,x] < 2d ■ exp (-d 4/5 {lnx + 1)) . 

The proof of Claim|5]follows after this proof. We have that 

E[Q'(«i)|<S(vi)] = Pr[Q(vi) < l\6( Vi )] + S[Q 4 (t- i )|^ 4 ),g(^) > l]-Pr[Q{ Vi ) > l\S( Vi )] 

< l + J B[Q i (^)l^),gK)>l]-Pr[Q(z; i )>l|5K)], (44) 

as Pr[Q(vi) < l\S(vi)] < 1- Also, it holds that 

S A = EiQ^ViMvilQivi) > 1] • Pr[Q{vi) > l\6( Vi )] 

/cxp[ln n] 
x'PrlQivi) > x\S(vi),Q(vi) > Mdx 

„2 , 



/cxp[ln n\ 
xtPrlQivi) > x\S(vi)]dx. 



The bound exp(ln 2 n) follows from simple calculations which suggest that, given that the maximum 
degree is In 2 n, for any path L € Sj it holds that C(L) < exp (in 2 n) . Note that 

Pr[Q(yi) > x\A( Vi )} = Pr[U S ^Si^\A(vi)] 

< A(vi)Pr[£i !X ] [from the union bound] 

< 2dA( Vi ) exp (-d 4/5 (l + lnx)) . [from ClaimO 
Thus, we get that 

/exp[ln 2 n] . . 

x*expf-d 4/5 lnxjdx 

/exp [In n] / i \ 

exp ( — d A/5 Inxjdx [as < t < Cd 7 / 10 ] 

< 2dA( Vi )exp(-d 4/5 ). 

e ln n _ j4/5 /o 

where in the final derivation we used the fact that J 1 x ' < 1. The lemma follows by plugging 
the above bound for Sa into (l44l ). o 

Proof of Claim |H As in the proof of Lemma [T3l we use the following terminology: Let wi, ■ ■ ■ , w$( v .} 
be the neighbours of vi outside the path P. Also, for wi let Si be the set of paths that start from Wi but 
do not use the vertices in P and they are of length at most In n/d 4 ^ 5 . 
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For the event S\ rX to occur, there should be a path L € Si such that C{L) > x. Let S\ C Sj denote 
the paths in Si of I vertices. It holds that 1 < I < In n/d 4 ' 5 . For every L E Si let II be an indicator 
variable such that 7^ = 1 if C(L) > x, otherwise II = 0. Applying Markov's inequality we get that 



Pr[S ljX ] = Pr 



E'*>° 



LeS* 



<£ 



LdS t 



From Theorem [8] we get the following: For any L € Sf we have that 

Pr[Ji] <exp[-ci 4/5 (j + lna 
Thus, by the linearity of expectation, it holds that 



E 



E^>° 



lnn/d 4 / 5 

= E 0(y-p(-^ 5 (i + ln,) 

lnn/d 4 / 5 

< exp(-d 4/5 lnx) J2 exp(-d i/5 j+jlnd 

i=i 

1 

1 _|_ e -d 4 / 5 +lnd' 



< exp(-d 4 / 5 In x - d 4 / 5 + In d) 



The claim follows by noting that ( 1 + e 



-d 4 / 5 +In< 



<2. 



(45) 



(46) 



[from d46J] 



[as(J)(d/n)'<#'] 



12.3 Proof of Lemma M 

Proof of Lemma[l4t The proof is just a matter of calculations. That is 



In 2 n-1 



S = J2 JE[W t (v l )\A,5(v i )=j}Pr[5(v t )=j\A] 

3=0 



(l+a)d-2 



ln^-2 



< (1 + 7)-* E JPr[6(vi)=j\A] + d ct J2 (J + 2) t+1 Pr[S(vi)=j\ 

3=0 j=(l+a)d-l 



_ t (l+a)d-2 



< 



J2 jPr[A(vi)=j} + 



d t,: 



In 2 -2 



J2 (j + 2) t+1 Pr[A(v t )=j] 



:i+t) 

L J 3=0 L J j=(l+a)d-l 

n In 2 -2 

< 2{l + 1 )- t Y J JPr[B{n,d/n)]+2d tc ^ (j + 2) t+1 Pr[A(t; 4 ) = j] 
3=0 j=(l+a)d-l 

In the last derivation we used the fact that A(uj) is dominated by B(n, d/n) and Pr[A] > 1/2. It is 
direct that the first summation is equal to d. As far as the second summation is regarded we use the 
Claim|4]we also used for the proof of Lemma [TT1 (See Section [Tl.ll for Claim|U). Thus it holds that 

S < 2d(l + 7)"* + 2d*( 1+c ) +1 exp (-a 2 d/5) . 

The lemma follows. o 
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13 Remaining Proofs 

13.1 Proof of Lemma Q] 

Let B\ be the set of blocks created from the cycles in C and let B2 = B\B±. It suffices to show that with 
probability at least 1 — 2n~ 3 / 4 , B\ contains only unicyclic blocks and $2 contains only trees. 

First we focus on the blocks in B\. It suffices to show that with high probability G(n, d/n) is such 
that no two cycles in C are close to each other, e.g. at distance smaller than 10p^. Then, Theorem [5] 
guarantees that the no two cycles will get to the same block. 

If there is a pair of cycles in C at distance less than 10p^, then the following should hold: There is 
a set of vertices 5 of cardinality less than 2-pA such that the number of edges between the vertices in 5 
is at least \S\ + 1. We are going to show that such a set does not exist in G(n, d/n) with probability at 
least 1 — n -3 / 4 . Let D be the event that such a set exists. It holds that 



o Inn o Inn 

ln^ d / \ / fr\ \ / j\ r+1 ln^ d _ / 9 \ r +l / j\ r+1 

n \ 1 U) \ I d\ s~^ ne\ r I re \ I a x 



^ * E * " z sE 



- 



j \r J \r + 1) \n J ~ j-—* \ r J \2(r + 1) J V ^ 

1 \ / \ / \ / r= i \ / \ 

o In n o In n 

kL-^d 1 er d\ f e 2 d\ ed Inn v^ d /e 2 (i 



n\ ,/ne 



as I < 

r / V r 



< :L T T <r2l—Lhr [as r < 2 In n/ ln^d] 



?? 






o inn 

^d_ Inn /fd\ ^d < ^_ 3/4 
" ln 2 d n V 2 / 

where the last two inequalities hold for large fixed d and large n. The above proves that part for B\ . 

So as to show that Bi consists of tree-like blocks we work as follows: Let some B € Bi and let v 
be the vertex we used to created. It is direct that every path that connects v to some vertex in any of 
the blocks in B\ should contain at least one break-point (otherwise v should belong to a block in B{). 
That is, if B contains a cycle then its length should be longer than 4^3. It suffice to show that with 
probability at least 1 — n~ 3//4 , every B G B2 cannot contain a cycle of length 3-p^ or more. 

From Theorem [5] we have that every vertex in B should be within distance at most j^ from v. We 

will show with high probability G ( v, ^Pi ) , the induced subgraph of G(n, d/n) that contains v and all 
the vertices within graph distance -rrh, is either a tree or unicyclic with sufficiently large probability. 

This implies that if there is a cycle in G (v, ^ J its length should not exceed 2^ + 1 < 3^, with 
the same probability. Thus, we prove the part related to £>2. 

It remains to prove that with probability at least 1 — n~ 3 ' 4 , G ( v, -pph J is either tree or unicyclic 
for any v. The proof is by contradiction, e.g. assume that there is a vertex v in the graph G(n, d/n) such 
G ( v, p^ ) contains more than one cycle. This implies that there is a set of vertices S of cardinality less 
than 2 7^7 such that the number of edges between the vertices in S is at least 151 + 1. We have shown 
above that the probability for such a set to exist in G(n, d/n) is at most n~ 3 ' 4 . The lemma follows. 

13.2 Proof of Lemma |3] 

As far as (A) is regarded we use the result from |jT], i.e. with probability 1 — o(l) the chromatic number 
of G(n, d/n) is d/(2 In d). As far as (B) is regarded we use Lemma[T]from Section 

As far as (C) is regarded observe the following: There is no break-point at the outer boundary of 
a block B that is influenced by a vertex within distance lnn/d 2 ' 5 inside B. Let Z, be the number of 
paths L in B such that \L\ = i and n«"eL W(u") > 1. Also, let pi be the probability for L to have 
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n«"Gi W{u") > 1. Using the tail bound in Theorem l8l(Section 16. 1 1 ) we have that 

Pi < exp (-d 4 / 5 i) . (47) 

Also, let Z = ^2 > inn 2j. It is direct to see that if Z = 0, then the condition (C) holds. From Markov's 



"d275 

inequality we have that 



Pr[Z > 0] < £[Z] < Y, ni+1 (-) Pi^ n E ^ 



n 



n 


E 

■ Inn 


exp(- 


i(d 4 / 5 - 


lnd)J 




u" 


~d27i 
2 


5 

OO 

i=0 




j j < 2n" 


d 2 /5 
2 



[from 03 



d 2 /5 

That is, condition (C) is satisfied with probability at least 1 — 2n 2 . Thus, it is direct to see that with 
probability 1 — o(l) G(n, d/n) satisfies all the conditions (A), (B) and (C). The lemma follows. 

13.3 Proof of Lemma |2] 

It is direct to show that once we have the break-points of G(n, d/n) the construction of the blocks can 
be done in polynomial time. The lemma will follow by showing that we can distinguish whether some 
vertex is break point or not in polynomial time. 

For a specific vertex v we need to check all the paths that start from v and are of length at most %^-. 
Working as in the proof of Lemma Q] we have that with probability at least 1 — n~ 3 / 4 for every vertex v 
in G(n, d/n) the neighbourhood we need to check is either a tree or a unicyclic graph. That is, there are 
at most 2 different paths between v and a vertex u at distance at most ^jt- The number of paths we need 
to consider is trivially upper bounded by In. The lemma follows by observing that the computation of 
the weight of a specific path L requires O(L) elementary arithmetic operations, i.e. O(lnn) operations. 

13.4 Proof of Lemma |6] 

Assume a process where we check whether the vertices in L = v\, . . . ,v\^\ are right break-points, 
starting from v\, then ^2,1*3 and so on. Since we are interested in right break points, so as to check Vj 
we don't have to examine the influence of paths getting to this vertex from its neighbor Vj + \. 

Assume that the process is to check Vi £ L. Let Vj G L, with j < i be the last right break point we 
have found so far. If we haven't found any break point yet, for convention we let j = 0. We need to 
show that J}* +1 U(v r ) is bigger or equal to the influence on v\ from paths that reach the vertex either 
from Vi-\ of from the neighbours of Vi outside L, i.e. the vertices in iVj. So as to show this, we use 
induction on the difference i — j. 

We start from i — j = 1. Let j > 0. Then we have that the path with the maximum influence on Vj 
is at most 1. That is the maximum influence of paths reaching to Vi from Vj is W{yi), where W(vi) is 
defined in ©. Also, the maximum influence on Vi from paths that pass through its neighbours in Ni is 
at most max{l, Q(v{)} ■ W(vi). Clearly among the aforementioned paths no path has influence bigger 
than U(vi) (i.e. U(v{) = max{l, Q(vi)}W(vi)). For j = 0, there are no paths reaching to Vi from Vj, 
i.e. we have the paths reaching from the vertices in Ni. Then, it is direct to check that the maximum 
influence on v-i is at most U(vi). 
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Let the induction hypothesis hold for i—j = jo- We are going to show that it holds for i — j = j'o + 1. 
By induction hypothesis we have that for Vj+j the maximum influence from a path that is reaching it 
either from Vj+j -i or from Nj + j is at most flrS+i U(v r ) > 1. For i such that i — j = jo + 1, the 
following holds: the influence of any path that reaches v% through Vi-\ (or Vj + j ) cannot be larger than 
W(iii) Yll^j^i U(v r ) (due to induction hypothesis). Also, there is no path reaching V{ from vertices in 
Ni that has influence larger than U(v%). It is a matter of direct calculations to verify that none of the 
paths we consider has influence greater than Or-i+i U{y r ). The lemma follows. 



Acknowledgement. The author of this work would like to thank Amin Coja-Oghlan for his comments 
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